Testing Optimality of Sequential Decision-Making
نویسندگان
چکیده
This paper provides a statistical method to test whether a system that performs a binary sequential hypothesis test is optimal in the sense of minimizing the average decision times while taking decisions with given reliabilities. The proposed method requires samples of the decision times, the decision outcomes, and the true hypotheses, but does not require knowledge on the statistics of the observations or the properties of the decision-making system. The method is based on fluctuation relations for decision time distributions which are proved for sequential probability ratio tests. These relations follow from the martingale property of probability ratios and hold under fairly general conditions. We illustrate these tests with numerical experiments and discuss potential applications. This work has been partly supported by the German Research Foundation (DFG) within the Cluster of Excellence EXC 1056 ’Center for Advancing Electronics Dresden (cfaed)’ and within the CRC 912 ’Highly Adaptive Energy-Efficient Computing (HAEC)’. The material in this paper has been presented in part at the IEEE International Symposium on Information Theory (ISIT), Aachen, Germany, June 2017 [1]. M. Dörpinghaus is with the Vodafone Chair Mobile Communications Systems and with the Center for Advancing Electronics Dresden (cfaed), Technische Universität Dresden, 01062 Dresden, Germany (e-mail: [email protected]). Izaak Neri is with the Max-Planck-Institute for the Physics of Complex Systems, Dresden, Germany, with the Max-PlanckInstitute of Molecular Cell Biology and Genetics, Dresden, Germany, and with the Center for Advancing Electronics Dresden (cfaed), Technische Universität Dresden, 01062 Dresden, Germany (e-mail: [email protected]). Édgar Roldán is with the Max-Planck-Institute for the Physics of Complex Systems, Dresden, Germany, with the Center for Advancing Electronics Dresden (cfaed), Technische Universität Dresden, 01062 Dresden, Germany, and with GISC – Grupo Interdisciplinar de Sistemas Complejos, Madrid, Spain (e-mail: [email protected]). H. Meyr is an emeritus of the Institute for Integrated Signal Processing Systems, RWTH Aachen University, 52056 Aachen, Germany and is now a grand professor of the Center for Advancing Electronics Dresden (cfaed) at Technische Universität Dresden, 01062 Dresden, Germany (e-mail: [email protected]). Frank Jülicher is with the Max-Planck-Institute for the Physics of Complex Systems, Dresden and with the Center for Advancing Electronics Dresden (cfaed), Technische Universität Dresden, 01062 Dresden, Germany, Germany (e-mail: [email protected]). January 8, 2018 DRAFT ar X iv :1 80 1. 01 57 4v 1 [ cs .I T ] 4 J an 2 01 8
منابع مشابه
Comparison Analysis of the Wald-s and the Bayes Type Sequential Methods for Testing Hypotheses
The Comparison analysis of the Wald’s and Bayestype sequential methods for testing hypotheses is offered. The merits of the new sequential test are: universality which consists in optimality (with given criteria) and uniformity of decision-making regions for any number of hypotheses; simplicity, convenience and uniformity of the algorithms of their realization; reliability of the obtained resul...
متن کاملOptimizing Red Blood Cells Consumption Using Markov Decision Process
In healthcare systems, one of the important actions is related to perishable products such as red blood cells (RBCs) units that its consumption management in different periods can contribute greatly to the optimality of the system. In this paper, main goal is to enhance the ability of medical community to organize the RBCs units’ consumption in way to deliver the unit order timely with a focus ...
متن کاملConvergence in a sequential two stages decision making process
We analyze a sequential decision making process, in which at each stepthe decision is made in two stages. In the rst stage a partially optimalaction is chosen, which allows the decision maker to learn how to improveit under the new environment. We show how inertia (cost of changing)may lead the process to converge to a routine where no further changesare made. We illustrate our scheme with some...
متن کاملSequential Decision Making with Rank Dependent Utility: A Minimax Regret Approach
This paper is devoted to sequential decision making with Rank Dependent expected Utility (RDU). This decision criterion generalizes Expected Utility and enables to model a wider range of observed (rational) behaviors. In such a sequential decision setting, two conflicting objectives can be identified in the assessment of a strategy: maximizing the performance viewed from the initial state (opti...
متن کاملOn Sequential Optimality Conditions without Constraint Qualifications for Nonlinear Programming with Nonsmooth Convex Objective Functions
Sequential optimality conditions provide adequate theoretical tools to justify stopping criteria for nonlinear programming solvers. Here, nonsmooth approximate gradient projection and complementary approximate Karush-Kuhn-Tucker conditions are presented. These sequential optimality conditions are satisfied by local minimizers of optimization problems independently of the fulfillment of constrai...
متن کاملMatrix Sequential Hybrid Credit Scorecard Based on Logistic Regression and Clustering
The Basel II Accord pointed out benefits of credit risk management through internal models to estimate Probability of Default (PD). Banks use default predictions to estimate the loan applicants’ PD. However, in practice, PD is not useful and banks applied credit scorecards for their decision making process. Also the competitive pressures in lending industry forced banks to use profit scorecards...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- CoRR
دوره abs/1801.01574 شماره
صفحات -
تاریخ انتشار 2018